On-Demand Source Code Generation & Scheduling Optimised Parallel Applications on Heterogeneous Platforms
نویسنده
چکیده
Scheduling applications tasks across heterogeneous clusters is a growing problem, particularly when new upgraded components are added to a parallel computing system that may have originally been homogeneous. We describe how automatic and just-in-time source code generation techniques can be used to make the best parallel decomposition for whatever resource is available in a heterogeneous system consisting of graphical processing unit accelerators and multi-cored conventional CPUs. We show how a high level domain specific language approach to our set of target simulation applications can be used to cater for a variety of different GPU and CPU models and scheduling circumstances. We present some performance and resource utilisation data illustrating the scheduling issue for heterogeneous systems in computational science. We discuss the future outlook for this code generation approach in software engineering.
منابع مشابه
From Heterogeneous Task Scheduling to Heterogeneous Mixed Parallel Scheduling
Mixed-parallelism, the combination of dataand taskparallelism, is a powerful way of increasing the scalability of entire classes of parallel applications on platforms comprising multiple compute clusters. While multi-cluster platforms are predominantly heterogeneous, previous work on mixed-parallel application scheduling targets only homogeneous platforms. In this paper we develop a method for ...
متن کاملTopology-aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platforms
Communications on hierarchical heterogeneous HPC platforms can be optimized based on topology information. For MPI, as a major programming tool for such platforms, a number of topology-aware implementations of collective operations have been proposed for optimal scheduling of messages. This approach improves communication performance and does not require to modify application source code. Howev...
متن کاملDesign and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems
OP2 is a high-level domain specific library framework for the solution of unstructured mesh-based applications. It utilizes source-to-source translation and compilation so that a single application code written using the OP2 API can be transformed into multiple parallel implementations for execution on a range of back-end hardware platforms. In this paper we present the design and performance o...
متن کاملParallelizing Existing Applications in a Distributed Heterogeneous Environment
Applications based upon the nite element method are well known for their demand for computational resources. An eeective method for satisfying this demand is heterogeneous parallel computing. This paper presents the results obtained by applying heterogeneous computing to a large, existing nite element application code: CSTEM. A diicult problem associated with heterogeneous computing is the mapp...
متن کاملAuto Source Code Generation and Run-Time Infrastructure and Environment for High Performance, Distributed Computing Systems
With the emergence of inexpensive commercial off-the-shelf (COTS) parts, heterogeneous multi-processor HPC platforms have now become more affordable. However, the effort required in developing real-time applications that require high-performance and high input/output bandwidth for the HPC systems is still difficult. Honeywell Inc. has released a suite of tools called the Systems and Application...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013